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' Abstract 

^3 ' This is my contribution to Proceedings of the International Workshop on Cosmic Structure and 

qq , Evolution, September 23-25, 2009, Bielefeld , Germany. In my talk I presented some non-Gaussian 

■ features of the foreground reduced WMAP five year full sky temperature maps, which were recently 
reported in the Ref. [I]. And in these notes I first discuss the statistics behind this analysis in some 
detail. Then I describe invaluable insights which I got from discussions after my talk on the Workshop. 
And finally I explain why, in my current opinion, the signal detected in the Ref. [I] can hardly have 
something to do with cosmological perturbations, but rather it presents a fancy measurement of the 
Milky Way angular width in the microwave frequency range. 
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Nowadays we witness a great progress in both theoretical and observational cosmology which makes our 
demands and expectations ever higher and turns us to discussing more and more subtle properties of the 
available data. One of such popular topics is the quest for primordial non-Gaussianities in the spectrum of 
the CMB radiation. Not really expected to be detectable for the simplest models of inflation, these small 
jy-^ ■ departures from the purely Gaussian signal would help to distinguish between more elaborate inflationary 

scenarios and would probably provide us with some new insights into the wonderful realm of the very 
early Universe. 

| The approach I discuss is based on a very simple idea. Assume that the Universe is statistically 

isotropic, and all the temperature fluctuations in the CMB radiation are of statistical nature. Then we 
decompose the fluctuations, as usual, into the spherical harmonics denoting the coefficients by a^ m and 
get 

rS ' ( a i,ma*i m i) — (a;,ma/',-m') = Ci5i t i'5 m m > . 

b . 

Moreover, a; !m 's with a fixed value of I but different values of m can be thought of as different realizations 
of one and the same random variable. So that one can naturally ask a question about the shape of 
the distribution. It can be answered by many methods, from plotting a histogram of the sample to 
estimating the higher moments of the distribution. A Gaussian distribution is completely determined 
by two parameters, its mean and its variance. For fluctuations the mean is taken to be 0, and the only 
parameter left is the variance, C;. (Of course, it implies rescaled ^-distributions for quadratic in a 
quantities.) If the random variable is known (or assumed) to be Gaussian, this single parameter can, in 
principle, be extracted from any part of the probability distribution. The idea of the Ref. [Tj is to take 
only the tails, i.e. to deduce the variance C; once more from only the distribution of large coefficients, 
|a;,m| 2 > Ci, and then to compare this result with the original one. Up to statistical variations in the 
number of data points in the tails, this corresponds to using the order statistics with somewhat more 
points than in top and bottom sextiles but with less points than in two marginal quintiles. 

Applied to the full sky foreground reduced maps, this method gives a well-pronounced peak in the 
difference between the two estimates for the variance. The peak is located in the range of I ~ 45 ± 15. 
The fluctuations outside of the peak are also much larger than would be expected statistically. This is, 
of course, due to remaining foregrounds contamination, and the only reason to take the peak seriously 
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was that it is a few more times larger than the other fluctuations pQ and it looks more or less the same 
in different frequency bands (up to the different overall level of noise). And this was also my conclusion 
that it should have something to do with cosmology. I explain the relevant statistics in Sections 2 and 
3. And for the graphical presentation of results, I refer the reader to [1J. However, at the Workshop 
I have learned from Pavel Naselsky that the multipole coefficients with even values of I + m have the 
worst contamination from Galaxy, see below. In Section 4 I discuss the data analysis with separation 
of (I + 7n)-even and (I + m)-odd harmonics, and show that the initial assumption of having different 
realizations of one random variable is heavily disproved due to Galactic signal which invalidates the claim 
for cosmological non-Gaussianities. My current conclusion presented in the Section 5 is that the effects 
observed in pQ refer to the structure of Galactic noise, and not to properties of primordial fluctuations. 
Due to this reason I cancelled my authorship for the second version of that article. (I was a co-author for 
the first one). And I would like to note here that all the computer work for the article pQ was done by my 
former co-author Vitaly Vanchurin and, needless to say, if there appears to be something primordial about 
this peak then the whole success should be attributed solely to him and his enthusiasm. An interested 
reader may also want to consult with the original reference pQ for the opinion opposite to mine. 



2 The method and possible variations 

The original approach was to consider a/. m 's with to > and fixed I as a sample of I observed values of 
a complex random variable with the Gaussian probability density of variance a 2 = C'i : 



p{z)dzdz* 
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Then a function j\ , defined by 
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where 9 is the step function, has an expectation value equal to the variance. Indeed, 
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zz*9(zz* — a 2 ) ■ exp 



dzdz* = 



ye y dy = —(y + l)e y 



Statistically, given a sample of N data points (in our case N — I), one evaluates the quantity 
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and compares it to a 2 . Note that in this Section I ignore the fact that we can do nothing but use the 
sample variance in {1}. I'll come to it later. 

In order to have more data points we need to resort to real variables because |<z;, m | = |a;,_ m |. In this 
case I would assume that there are N = 21 + 1 observations, namely \ai^\, V2\diai i7n \ and v^l^a/,™! for 
to > 0, of a real random variable, in which case the probability density is 



p{x)dx 
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and the same analysis carries over for another observable 
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h{x) 



1 — erf 



x 2 9(x 2 -a 2 ) 
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with the error function defined by erf (i) = J* e x dx. One can prove prove that (fa) = a 2 using the 
following simple analytic trick: 
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All of this can be done in a variety of ways. For example, one can work with order statistics which 
I already mentioned in the Introduction. Another possible idea is to average only over N* data points 
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which are larger than the standard deviation without summing the zeros for excluded entries as it should 
be done according to |T]) or The mean number of remaining observations is given by (N*) = 

N f 9(\x\ 2 - a 2 )p(x)dx. 

In the complex case we have (N*) = — and transform fi into a new estimator 

= 1 J2 z * z t ■OizjZ* -a 2 ) 

if 3i : ZiZ* ^ a 2 and f3 = a 2 otherwise. The latter case has only a tiny probability with a God-given 
variance a 2 , and it is absolutely impossible if the sample variance is used. Surprisingly, (fa) = a 2 with 
no bias. A simple way to check it is to divide the integration domain into 2 N parts with definite signs of 
all ti — 1 where U = -^t- The integral under consideration is a symmetric function of the variables U 7 
therefore for each k, k ^ N, one can consider myzMi indentical integrals with ti ^ 1 for i ^ k and 
tj ^ 1 for j > k + 1: 
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(h) = h - \\_p{z l )dz i dz. 
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The product symbols denote here the products of J dte~ l operators, and not of the integrands after the 
dot. Note also that if we were to define f3 = when all the observations are below the standard deviation, 
then a tiny bias of 1 part in e N would have been there. 

Finally, for real variables (N*) = ^1 — erf \ ' N, and the corresponding estimator is: 



l-erf ,/i 2 



E^-0(x 2 -a 2 ) 



U = , A v 7 r- ■ H^dr^d (4) 



1 — erf 



2 I ~ V we 



and fi = a if Vi : x 2 < a . The proof that (f4) = u is exactly the same as for f3 but with binomial 
series for (Yl - erf + erf (\/F)) ■ 

3 A statistical interlude 

Up to this point I was using the variance in all estimators as if it were known exactly. In reality the sample 
variance is used, of course. Generically, it should induce some bias. For example, with N observations of 
a random variable x with the mean value fj, one can estimate the variance as Sfe^i^ , it is well-known 
that if the sample average is used for the mean, then this estimator is only asymptotically unbiased, 
that is it has a non-zero bias which tends to zero when N — ► 00. An unbiased estimator is ^trEx ■ It- 
diverges if N = 1 which makes a good sense since nobody can ever measure two parameters with a one 
simple observation. Otherwise, one combination of the data is used to define the mean while the others 
give information about the random deviations. 

I will illustrate the bias only for the estimator f 1 , in which case we have 

e N ( 

f 1 = 2N E z ^ ' 9 z * z * ' 
1=1 \ 

With the variables ti = one gets 

2 N poo N / N \ 2 N roc I I N \ 

&> = ^II / due-** (*i-^E** =f f II / dt l e^-t N e\t N --Y. t A 

z=l " j=l \ fc=l / i=l " \ j=l J 
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In the last equality I have used the fact that everything is symmetric with respect to permutations of U 
variables. Now I introduce a new variable v = Xh=i instead of and obtain the final result: 

2 Tf-l -oo f ( N-l \ 

(fi) = — II I dt t - J dv\v-J2 l A 

It is clear that the estimator is only asymptotically unbiased, (fi) ~ a 2 (l + + O (j^))- In any case, 
we are not expected to do better. There are many other difficulties in extracting the CMB data from 
observations. 

Of course, one can also estimate the standard deviations for the estimated quantities. But note 
that in our case the only meaning of it is the minimal level of fluctuations which should be there if 
the origin of the signal is stochastic. The actual fluctuations are higher. Nevertheless, the standard 
deviation of fi can be found as a/ (ff) — (fi) 2 , and one can easily check that, as usual, it reduces to 
~JnV (/i ) — (/i) 2 = ^^^ c 2 ~ 1.55-^=. But actually, we are interested in the fluctuations of the 
difference between the two estimations of the variance. This difference fluctuates less than the individual 
terms. Indeed, 
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It gives the deviation w 0.63-^= which should be approximately correct because the random excursions 
of a are much less than those of the individual observations. 

For the estimators © and © the standard deviations are not of the form somethi ^a_ smi P le , One 
can find them exploiting the above trick with the integration domains, this time for the integrand 

W (Si=i tij ■ The result would be 

N ati / i\N-k 1412,1, 4 N »ri / i\N-k 



y kl(N-k)l \ e) ' k 2 ~ ° + T ^ kl(N - k)l 



The first term is just (fa) 2 while the second one gives the variance. Asymptotically, only large values 
of k do matter, and we can substitute k by k + 1. Up to the factor of it gives the binomial 

formula again (neglecting the contribution of the first few terms). Hence, the standard deviation is given 

by w ^ e %- ~ 0.8-7??. A little bit more tedious calculation shows that the standard deviation of the 
difference between (fa) and the sample variance has the same asymptotic behaviour. 

Ideally, one should also take the effects of using the sample variance in the step functions into account. 
But it is not of my concern now. For clean maps the main task would have been to test the hypothesis 
that all the deviations from zero difference between two Ci estimations are purely statistical. However, 
we use the noisy maps. And the signal considerably deviates from Gaussianity anyway. The curious 
result was only about the large peak at I w 45 ± 15 which was argued to have a cosmological origin [TJ. 



4 The structure of Galactic contamination and the data 

My talk at the Workshop was followed by a very interesting discussion, and I learned from Pavel Naselsky 
that the az jrn -coefficients with even values of I + m are more contaminated by the Galaxy [2]. The reason 
is very simple to understand examining the Rodrigues formula for the associated Legendre polynomials 
on the interval z 6 [—1,1], 

P r (z)=const.(l-z 2 y^(z 2 -l) 1 . 

The Galactic plane corresponds to z = 0. Therefore, (I + m)-odd polynomials are antisymmetric under 
reflection in Galactic plane [z — > —z) and have roots at z = 0, while the (/ + m)-even ones have local 
extrema at the same place. Note also that I ~ 45 corresponds to the scale of several angular degrees 
which nicely matchs with the apparent width of the Milky Way and with the width of the red central 
stripe on the pictures of the WMAP results. One has to analyse these harmonics separately. And as 
reported in the second version of p], the (I + m)-even coefficients reproduce mostly the same shape of 
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Figure 1: Estimated values of • + ^ Cl in units of fiK 2 for the W band. The red line is obtained 
with all (7 + m)-even harmonics. For the purple line only large (7 + ro)-even coefficients are used. The 
(l + m)-odd multipoles give the blue line, while restricting them to only the large ones results in the green 
line. 

the super-Gaussian peak, while the (7 + m)-odd ones give a small sub-Gaussian valley at the same values 
of I. It already shows that different a; !m 's for the same C; are not at all equivalent as they were assumed 
for the purposes of this analysis. 

I also handled the W band data manually in order to gain a better perception of the numbers. 
Examining the data, one can see the super-Gaussian character of (I + m)-even observations with almost 
a naked eye. There are many points with large values, and many points are considerably smaller than 
the standard deviation. Sometimes just a couple of very large data points makes a significant part of the 
peak. For example, I found that the imaginary part of a^e^e is more than two times larger than even the 
largest of other numbers in the group of d§. 

On the Fig. 1 I present the values of l ^ l+ ^ Cl versus I. Only the left, ascending part of the peak is 
plotted there, as it is less challenging for a manual computation. Unlike in [T], I used the estimator 
The data points are binned into the groups of five, that is the values of the function are given only for 
values of the argument congruent to zero modulo five, and each ordinate is an average of five values, 
those from I — 2 to I + 2. For even harmonics we see the peak, where the red line is the sample variance 
for all even coefficients, while the purple line is obtained with the estimator (f4]). The same is done for 
odd harmonics (the blue and the green lines), and a small valley is revealed. I won't bet for the precise 
ordinates as I was calculating manually and rounding the data a little bit. But the general structure is 
represented correctly and agrees with the claim in the Ref. pQ. We can see that the peak is somewhat 
reduced compared to what was plotted in [TJ without the separation of harmonics, at least in its relative 
weight, And what is more important, the red line goes several times higher than the blue one. 

(And even the blue line is some factor of two higher than the actual primordial radiation [3].) This 
is true not only in average, but also for every single Ci. It clearly shows that the dominant signal for 
these multipoles comes from the Galaxy, and it also heavily disproves the original hypothesis of having 
different observations of a one random variable. If I + m is even, one also gets the large coefficients with 
large m's more frequently than for the middle values of m, as was stated in pQ. This effect is by far less 
pronounced than the difference between the red and the blue lines. But on the other hand, it shows that, 
even after the separation of different parity harmonics, the signal cannot be analysed reliably in this way. 
A considerable part of the initial peak came from the mixing of harmonics with essentially different levels 
of contamination which overweights the central part and the tails of the distribution. The remaining 
(/ + m)-even peak may also be the consequence of a non-uniform contamination, although the Galactic 
signal by itself is not very Gaussian. It could also be an interesting problem to compare the results of real 
and complex random variables analysis. The direction of zero Galactic longitude points at the Galactic 
center, and therefore cosine and sine harmonics might receive different contaminations. 

The peak disappears when we go to larger I and the spherical harmonics start probing the latitude 
scales smaller than the width of the bulk of Galactic signal. And therefore the different parity harmonics 
become not so different in the contamination level. For example, l ^ l ~ i 2 1 J Cl for / = 100 estimated with 
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(I + m)-even harmonics is only few percent larger than estimated with (I + m)-odd ones. (It is about 
8000/i-ftT 2 .) Moreover, |aioo.o| is quite large, and Cioo-even estimated without m — is a bit smaller than 
Cioo-odd. 

It is hard to infer about the origin of the valley without knowing the detailed structure of the Galactic 
noise. It can come from some non-Gaussian properties of the noise. It can reflect some shortcomings 
of the foreground reduction procedures somehow oversubtracting the super-Gaussian noise from less 
affected coefficients. Probably, one could even devise a reasonable mixing of signals which would mimic 
a sub-Gaussian distribution, although it may require some bias of the mean values too. 

5 Conclusions 

In these notes I discussed the statistics behind the non-Gaussian anomalies reported in Ref. PQ. After 
that I have shown that the most probable explanation of the signal refers to geometric properties of the 
Galactic noise, and not to cosmology. Admittedly, I do not have a good understanding of the structure 
of Galactic noise. But at the very least, the claim for cosmological non-Gaussianities is pretty much 
premature. (I refer an interested reader to the work pQ for a different opinion.) On the other hand, one 
could probably use this kind of analysis to extract some information about the structure of the noise. 
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